Rank | Count | Beginning |
---|---|---|
23354 | 2412 | У |
13872 | 725 | На |
9534 | 653 | І |
5498 | 609 | Гэта |
1589 | 585 | Але |
93 | 570 | А |
28399 | 455 | Я |
7542 | 424 | Ен |
13179 | 408 | Мы |
16338 | 389 | Паводле |
10658 | 328 | Калі |
28741 | 316 | Як |
8230 | 282 | З |
130 | 264 | Аб |
16260 | 254 | Па |
22371 | 246 | Таму |
8297 | 242 | За |
1330 | 199 | Акрамя |
19421 | 191 | Прэзідэнт |
19014 | 190 | Пры |
18390 | 182 | Пра |
29457 | 182 | Яны |
27488 | 181 | Цяпер |
25894 | 160 | Усе |
7018 | 158 | Для |
15374 | 158 | Не |
17492 | 151 | Пасля |
28055 | 147 | Што |
2294 | 146 | Аляксандр |
6376 | 146 | Да |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV